

#### **RE-ENGINEERING COMPUTING WITH NEURO-MIMETIC DEVICES, CIRCUITS, AND ALGORITHMS**

#### Kaushik Roy

Abhronil Sengupta, Gopal Srinivasan, Aayush Ankit, Priya Panda, Xuanyao Fong, Deliang Fan, Jason Allred

> School of Electrical & Computer Engineering Purdue University

Center for Spintronics, National Science Foundation, DARPA, Vannevar Bush Fellowship, Office of Naval Research, Semiconductor Research Corporation, Intel

#### HUMAN VS. MACHINE CHRONICLES

2016



Google AlphaGo vs. Lee Sedol (1920 CPUs, 280 GPUs) ~300000W



1997



IBM Deep Blue vs. Kasparov

IBM RS/6000 32-node server (Power2 + 8 dedicated chips) ~15000 W

#### 2011



IBM Watson vs. Brad Ritter &Ken Jennings90 Power 750 Express servers

(4 8-core CPUs) ~20000W

# WHAT ARE THE ISSUES WITH DLN?

- Requires massive amount of training data
  - Learn with much less data
- Supervised learning
  - Need for higher bio-fidelity
- Incremental/adaptive learning is difficult catastrophic forgetting
  - Life-long learning
- Huge power consumption
  - Event driven evaluation can help; Approximate hardware
- Well-suited for image, speech, text recognition..
  - Need for cognitive systems to perform larger range of functions not just sensory processing, but also reasoning and decision making
- Can neuro-mimetic devices help?
  - von-Neumann architecture not suitable
    - Need for in-memory-computing, efficient neurons and synapses

#### **Neuromorphic Computing: An Overview**



STUDENTS/Post-doc: Abhronil Sengupta, Priyadarshini Panda, Syed Shakib, Gopal Srinivasan, Chamika Liyanagendra, Bing Han, Jason Allred, Parami Wijesinghe, Ayush Ankit, Maryam Parsa, Shuhan Chen and others...

## **Cognitive Computing**



#### Device/Circuit/Algorithm Co-Design: Spin/ANN



for system-level simulations of neuromorphic systems System Level Solution V

#### **Bottom-Up Perspective**



Investigate device physics to mimic "neuron/ synapse" functionalities

Calibration of device models with experiments





# **SPIKING NEURAL NETWORKS**

# **Spiking Neuron Dynamics**



Postsynaptic neuron spikes when membrane potential crosses a certain threshold and gets reset

#### **Spike Timing Dependent Plasticity: Example**



Bi and Poo, The Journal of Neuroscience, December 15, 1998.

## **Artificial Neural Networks: Simple Model**



Spintronic neurons operating at ultra-low terminal voltages interfaced with resistive synapses lead to energy-efficient ANNs



# DEVICES: NEURONS, SYNAPSES, IN-MEMORY COMPUTING

#### Building Primitives: Memory, Neurons, Synapses



#### Spin Transfer Torque Induced Domain Wall Motion

- Multi-domain magnets consists of a domain wall (DW) separating regions with opposite magnetic polarizations
- Domain wall can be moved in the direction of electron flow
- MTJ resistance varies with domain wall position
- Decoupled "write" and "read" current paths
- Low current induced Ne'el domain wall motion can be achieved by spin-orbit torque generated by spin-Hall effect from a heavy metal underlayer in presence of DMI



Universal device: Suitable for memory, neuron, synapse, interconnects

## **Core Computing Blocks**



# **SHE Induced DW Motion: Neuron**



**Non-spiking Neuron** 

**IF Spiking Neuron** 

- Three terminal spintronic device acting as a neuron (with different degrees of bio-fidelity) and synapse
- The neuron is interfaced with the "axon" circuit to generate a corresponding analog output current with variation in the input current / Integrate-Fire "spiking" neuron can be implemented using a similar device structure where the MTJ is located at the extreme edge of the FM.
- Synapse, acting as the memory element, encodes the weight in its conductance value which is determined by the domain wall position

Sengupta, Roy, TBioCAS'16; Sengupta, Roy, TCAS'16

## **Core Computing Blocks**



# **Self-learning in Spiking Neural Networks**



#### **Spike-Timing Dependent Plasticity**

- Spintronic synapse in spiking neural networks exhibits spike timing dependent plasticity observed in biological synapses
- Programming current flowing through heavy metal varies in a similar nature as STDP curve
- Decoupled spike transmission and programming current paths assist online learning
- 48fJ energy consumption per synaptic event which is ~10-100x lower in comparison to emerging devices like PCM

## **Network for Pattern Recognition**



Pattern recognition performed in a network of excitatory spiking neurons in presence of lateral inhibition and homeostasis

#### **Self-Learning in Spiking Neural Network**



# Energy Comparison – Spintronic Engine & CMOS Baseline



 Spintronic architecture achieves energy improvements of 204X – 2759X with respect to SNeuE (CMOS baseline) across all benchmarks.

- Spintronic crossbar enabled in-memory processing enables to overcome the memory domination/bottlenecks in the CMOS engine.
- Spintronic neurons interfaced with spintronic crossbar (SCA) allows energy-efficient innerproduct computations.



# STOCHASTIC NEURAL NETWORKS: SHE BASED MTJ



# **Spiking Neuron Membrane Potential**



The leaky fire and integrate can be approximated by an MTJ – the magnetization dynamics mimics the leaky fire and integrate operation Sengupta, Roy, Scientific Reports'16

#### **Interconnected Crossbars for NNs**



#### SHE Based MTJ: Stochastic Neuron & Synapse

- Exploit the stochastic switching behavior of a mono-domain MTJ in the presence of thermal noise – sigmoidal function
- Stochastic sigmoidal neuron
- Replaces multi-bit synapses with a stochastic single bit (binary) synapse.



# SHE-Based Switching: Experiments



- Anomalous Hall effect (AHE) in FM layer: Hall resistances change abruptly when the magnetization switches in FM.
- Ta(5nm)/CoFeB(1.3nm)/MgO(1.5nm)/Ta(5 nm) (bottom to top) Hall bar structure; CoFeB shows PMA.
- In-plane field  $H_L$  applied
- A current pulse group with changeable magnitude is sent through I+/-. Then resistance change is measured across V+/-



# **Stochastic Binary Synapse**

- Synaptic strength is updated based on the temporal correlation between pre- and post-spike trains
- Synaptic learning is embedded in the switching probability of binary synapses
- Switch the MTJ based on spike timing by passing the required write current





#### Spiking Neuromorphic Architecture with LT-ST Synapses



- Crossbar arrangement of the LT-ST synapses with CMOS neurons
  - The significant LT synapses are driven by a higher read voltage

- Network of LT-ST synapses provides 5% improvement in the classification accuracy over one-bit synapses
- Under iso-accuracy conditions, the LT-ST synapses offer a 2X reduction in the synaptic write energy

#### **From Devices to Circuits and Systems**





C-SPIN Annual Review, September 27-28, 2017

C-SP<sup>†</sup>N

#### **Simulation Framework**



## Conclusions

- Other than memory, STT devices also show promise for a class of computing models such as "brain-inspired computing", stochastic computing
- STT-devices as "neurons" and "synapses" for both ANN, SNN show the possibility of large improvement in energy compared to CMOS implementation